NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Incentivize without Bonus: Provably Efficient Model-based Online Multi-agent RL for Markov Games

Yang, Tong; Dai, Bo; Xiao, Lin; Chi, Yuejie (July 2025, PMLR)

Multi-agent reinforcement learning (MARL) lies at the heart of a plethora of applications involving the interaction of a group of agents in a shared unknown environment. A prominent framework for studying MARL is Markov games, with the goal of finding various notions of equilibria in a sample-efficient manner, such as the Nash equilibrium (NE) and the coarse correlated equilibrium (CCE). However, existing sample-efficient approaches either require tailored uncertainty estimation under function approximation, or careful coordination of the players. In this paper, we propose a novel model-based algorithm, called VMG, that incentivizes exploration via biasing the empirical estimate of the model parameters towards those with a higher collective best-response values of all the players when fixing the other players’ policies, thus encouraging the policy to deviate from its current equilibrium for more exploration. VMG is oblivious to different forms of function approximation, and permits simultaneous and uncoupled policy updates of all players. Theoretically, we also establish that VMG achieves a near-optimal regret for finding both the NEs of two-player zero-sum Markov games and CCEs of multi-player general-sum Markov games under linear function approximation in an online environment, which nearly match their counterparts with sophisticated uncertainty quantification.
more » « less
Free, publicly-accessible full text available July 13, 2026
Vertical Federated Learning with Missing Features During Training and Inference

Valdeira, Pedro; Wang, Shiqiang; Chi, Yuejie (April 2025, The Thirteenth International Conference on Learning Representations)

Free, publicly-accessible full text available April 24, 2026
A theoretical analysis of self-supervised learning for vision transformers

Huang, Yu; Wen, Zixin; Chi, Yuejie; Liang, Yingbin (April 2025, International Conference on Learning Representations (ICLR))

Free, publicly-accessible full text available April 24, 2026
A Theoretical Analysis of Self-Supervised Learning for Vision Transformers

Huang, Yu; Wen, Zixin; Chi, Yuejie; Liang, Yingbin (April 2025, The Thirteenth International Conference on Learning Representations)

Free, publicly-accessible full text available April 24, 2026
Leveraging Multimodal Diffusion Models to Accelerate Imaging with Side Information

https://doi.org/10.1109/ICASSP49660.2025.10889036

Efimov, Timofey; Dong, Harry; Shah, Megna; Simmons, Jeff; Donegan, Sean; Chi, Yuejie (April 2025, IEEE)

Free, publicly-accessible full text available April 6, 2026
Convergence and Privacy of Decentralized Nonconvex Optimization With Gradient Clipping and Communication Compression

https://doi.org/10.1109/JSTSP.2025.3526081

Li, Boyue; Chi, Yuejie (January 2025, IEEE Journal of Selected Topics in Signal Processing)

Full Text Available
Communication-Efficient Federated Optimization Over Semi-Decentralized Networks

https://doi.org/10.1109/TSIPN.2025.3539004

Wang, He; Chi, Yuejie (January 2025, IEEE Transactions on Signal and Information Processing over Networks)

Full Text Available
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning

Gu, Shangding; Shi, Laixi; Wen, Muning; Jin, Ming; Mazumdar, Eric; Chi, Yuejie; Wierman, Adam; Spanos, Costas (April 2025, The Thirteenth International Conference on Learning Representations)

Free, publicly-accessible full text available April 24, 2026
Provably Robust Score-Based Diffusion Posterior Sampling for Plug-and-Play Image Reconstruction

Xu, Xingyu; Chi, Yuejie (December 2024, The Thirty-eighth Annual Conference on Neural Information Processing Systems)

Full Text Available
The Sample-Communication Complexity Trade-off in Federated Q-Learning

Salgia, Sudeep; Chi, Yuejie (December 2024, 38th Conference on Neural Information Processing Systems)

Full Text Available

« Prev Next »

Search for: All records